Closest Strings, Primer Design, and Motif Search
نویسندگان
چکیده
Spurious Motifs.Our algorithm finds all positions in the strings that meet the specified requirements and will report no positions that do not; this is its advantage in contrast to algorithms in [5] and [3]. However, it clearly cannot exclude to find spurious motifs, e.g., random motifs in artificially generated data that have not been implanted. Note that not all combinations of L and d values are reasonable; if d is chosen too large compared to L, we can expect spurious randommotifs [3], e.g., with 20 random sequences of length 600, one can expect at least one (15, 5)-motif.
منابع مشابه
More Efficient Algorithms for Closest String and Substring Problems
The closest string and substring problems find applications in PCR primer design, genetic probe design, motif finding, and antisense drug design. For their importance, the two problems have been extensively studied recently in computational biology. Unfortunately both problems are NP-complete. Researchers have developed both fixed-parameter algorithms and approximation algorithms for the two pr...
متن کاملDesigning Of Degenerate Primers-Based Polymerase Chain Reaction (PCR) For Amplification Of WD40 Repeat-Containing Proteins Using Local Allignment Search Method
Degenerate primers-based polymerase chain reaction (PCR) are commonly used for isolation of unidentified gene sequences in related organisms. For designing the degenerate primers, we propose the use of local alignment search method for searching the conserved regions long enough to design an acceptable primer pair. To test this method, a WD40 repeat-containing domain protein from Beauveria bass...
متن کاملA Closer Look at the Closest String and Closest Substring Problem
Let S be a set of k strings over an alphabet Σ; each string has a length between ` and n. The Closest Substring Problem (CSSP) is to find a minimal integer d (and a corresponding string t of length `) such that each string s ∈ S has a substring of length ` with Hamming distance at most d to t. We say t is the closest substring to S. For ` = n, this problem is known as the Closest String Problem...
متن کاملar X iv : c s . C C / 0 20 50 56 v 1 2 1 M ay 2 00 2 Parameterized Intractability of Motif Search Problems ∗
We show that Closest Substring, one of the most important problems in the field of biological sequence analysis, is W[1]-hard when parameterized by the number k of input strings (and remains so, even over a binary alphabet). This problem is therefore unlikely to be solvable in time O(f(k) · n) for any function f of k and constant c independent of k. The problem can therefore be expected to be i...
متن کاملParameterized Intractability of Motif Search Problems
We show that Closest Substring, one of the most important problems in the field of biological sequence analysis, is W[1]-hard when parameterized by the number k of input strings (and remains so, even over a binary alphabet). This problem is therefore unlikely to be solvable in time O(f(k) · n) for any function f of k and constant c independent of k. The problem can therefore be expected to be i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002